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A-INTRODU  CTION 


RUSSO.  Jose 


Breast  cancer  is  a  hormone  dependent  malignancy  whose  incidence  is  steadily  increasing  in  most  western 
societies  and  in  countries  that  are  becoming  industrialized  (1-5).  Despite  significant  advances  in  early 
detection,  surgery,  radiation,  chemo-  and  endocrine  therapy,  the  mortality  caused  by  this  disease,  that  in 
the  United  States  is  only  second  to  lung  cancer  as  a  cause  of  cancer-related  death,  has  remained  almost 
unchanged  for  the  past  five  decades  (1).  New  studies  of  the  biochemical  mechanisms  evoked  by 
conventional  treatments  for  neoplastic  diseases  point  to  apoptosis  as  a  key  process  for  elimination  of 
unwanted  cells  (6).  Impaired  function  of  apoptosis-related  genes  is  deeply  involved  in  oncogenesis  and 
the  progression  of  cancers  (6-9).  Among  the  genes  that  control  apoptosis  is  Fas  (CD95/Apo-l)  a  cell 
membrane  receptor  that  upon  binding  by  its  ligand  (FasL)  triggers  a  signal  resulting  in  apoptotic  cell 
death  (8-10).  Fas  is  a  cell-surface  receptor  that  exists  in  two  forms,  transmembrane  and  soluble.  The 
former  induces  apoptosis  by  ligation  of  FasL  or  agonistic  anti-Fas  antibody,  whereas  the  latter  inhibits 
Fas-mediated  apoptosis  by  neutralizing  its  ligand  (11,  12).  Harnessing  the  power  of  this  complex 
molecule  is  expected  to  lead  to  the  development  of  powerful  chemotherapeutic  advances.  Our  Laboratory 
has  recently  found  a  link  between  the  Fas  ligand-receptor  complex  in  chemically  transformed  human 
breast  epithelial  cells  and  a  gene/s  located  in  chromosome  17pl3.2pl3.1  (13).  Whereas,  gene  transfer  of 
Fas  ligand  (CD95L)  using  adenoviral  vectors  has  been  shown  to  generate  apoptotic  responses  and  potent 
inflammatory  reactions  that  can  be  used  to  induce  the  regression  of  malignancies  in  vivo,  there  are 
significant  unwanted  reactions  such  as  hepatotoxicity  that  may  limit  their  clinical  utility  (7,10).  Therefore 
what  is  needed  is  to  identify  which  is  the  gene  or  genes  that  may  regulate  Fas  ligand-receptor  complex 
(12,14-18).  For  this  purpose  we  have  proposed  the  following  aims:  1)  To  isolate  in  the  precise  location 
in  chromosome  17pl3.2-pl3.1  the  gene(s)  responsible  for  the  control  of  Fas-receptor  ligand  complex 
function,  2)  To  test  the  cloned  gene/s  functionality  in  cell  transformation  by  cDNA  transfection 
studies  in  vitro,  and  3)  To  determine  the  functional  role  of  the  isolated  gene  in  the  process  of 
neoplastic  progression  in  vivo. 

B-BODY 

B-i~The  experimental  system.  We  have  developed  an  in  vitro  -  in  vivo  system  in  which  the 
environmental  carcinogen  benz(a)pyrene  (BP)  has  been  utilized  for  inducing  in  vitro  the  transformation  of 
human  breast  epithelial  cells  (HBEC)  (19-32).  For  developing  this  paradigm  of  human  breast  cancer  we 
have  capitalized  in  the  availability  of  the  mortal  HBEC-MCF-IOM,  which  without  viral  infection,  cellular 
oncogene  transfection,  or  exposure  to  carcinogens  or  radiation  became  spontaneously  immortalized, 
originating  the  cell  line  MCF-I0F  (33,34).  Treatment  of  MCF-10M  cells  with  chemical  carcinogens 
failed  in  inducing  cell  transformation,  while  MCF-10F  cells  responded  to  in  vitro  treatment  with  BP  with 
the  expression,  in  a  progressive  fashion,  of  all  the  phenotypes  indicative  of  neoplastic  transformation. 
BP-treated  MCF-  IOF  cells  expressed  increased  survival  and  formation  of  colonies  in  agar  methocel,  loss 
of  ductulogenic  properties  in  collagen  matrix,  invasiveness  in  a  Matrigel  in  vitro  system  (clones  BP-1)  and 
tumorigenesis  in  severe  combined  immunodeficient  (SCID)  mice  (BPI-E)  (19,  22,  27).  We  have 
demonstrated,  by  utilizing  DNA  amplification  of  micro  satellite  length  polymorphism,  allelic  losses  and 
microsatellite  instability  in  different  areas  of  the  genome  of  the  HBEC  transformed  by  chemical 
carcinogens  (24,  25,  34-37,  42,  43).  MSI  was  detected  in  chromosomes  11  and  17  in  the  immortalized 
cells,  and  additional  MSI  in  chromosomes  11,13  and  17  in  the  transformed  cells.  LOH  was  detected  only 
in  one  locus  of  chromosome  1 7pl 3.2-pl 3. 1  in  the  transformed  cells  BPI-E  using  the  marker  D17S796 
(13)  (Figure  1).  We  have  been  a  unable  to  identify  any  other  genetic  alterations  in  more  than  40  other 
markers,  including  p53,  in  the  p  or  q  arm  of  chromosome  17  (13).  Because  chromosomes  1 1  and  17  were 
involved  in  both  the  early  and  late  stages  of  carcinogenesis  we  selected  them  for  testing  their  functional 
roles  in  chemically  transformed  HBEC  using  a  microcell-mediated  chromosome  transfer  (MMCT) 
technique  (35-38).  Our  study  found  that  seven  out  of  ten  clones  with  chromosome  17  transferred  in  to 
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BP  IE  cells  had  reverted  transformed  phenotypes  such  as  advantageous  in  growth,  colony  formation  in 
agar-methocel,  loss  of  ductulogenesis  and  resistant  to  Fas  mediated  apoptosis  (13).  Our  molecular 
analysis  of  MCF10F  and  BP1E  cells  reveled  that  in  the  process  of  chemical  transformation  loss  of  genetic 
material  (LOH)  in  the  chromosome  17  pi 3.2  (D17S796),  which  has  been  identified  by  other  investigators 
in  atypical  ductal  hyperplasia  and  in  situ  ductal  carcinoma  of  the  breast  (39),  has  been  restored  in  the  cells 
with  abrogated  transformation  phenotype  (Figure  1). 


Figure  1:  LOH  in  BP1E  using  the  marker 
D17S796  and  restoration  of  this  region  in 
BP1E-1 7  neo.  Single-stranded  conformation 
polymorphism  (SSCP)  analyses  were  done 
using  DNA  from  MCF-10F  (line  1 ),  BP  IE 
(line  2)  and  BP1E-1 7  neo  clone  II-3  (line  3). 
PCR  reactions  were  performed  in  a  total 
volume  of  10  pi  containing:  IX  buffer  with 
MgCl2  (Perkin  Elmer ),  400  pM  of  each 
dATP,  dTTP,  dCTP  and  dGTP,  200  pM  of 
the  primer  Dl  7S796  Reverse ,  200  pM  of  the 
primer  D17S796  Forward,  0.2  U  Taq 
Polymerase  (Perkin  Elmer),  1  pCi  f2P-a] 
dATP  and  120  ng  total  DNA.  The  thermal 
cycling  consisted  on:  94°C,  2  min  for 
denaturalization,  and  35  cycles  of  94°C,  45 
sec;  56°C,  45  sec  and  72°C,  45  sec  followed 
by  an  extension  at  72°C,  10  min.  PCR 
products  were  run  on  6%  polyacrylamide  gel 
and  autoradiographied. 


This  indicates  that  the  LOH  in  this  region  may  regulate  the  early  event  of  breast  cancer  initiation  and  that 
the  region  of  D17S796  (7586bp)  might  contains  gene/s  controlling  the  transformation  phenotypes.  More 
importantly  it  was  observed  that  in  those  clones  in  which  the  reversions  of  the  phenotype  were  present 
also  the  cells,  loss  the  resistance  to  Fas  mediated  apoptosis.  Transfer  of  chromosome  1 1  neither  revert  the 
transformation  phenotypes  or  the  resistance  to  the  Fas  mediated  apoptosis.  These  data  allowed  us  to 
postulate  that  chromosome  17  pl3.2  (marker  D17S96)  might  contain  a  gene  or  genes  that  are  controlling 
this  process. 


B-ii-. Progress  Report. 

In  this  report  we  cover  the  work  performed  from  July  1,  2002  to  March  30,  2003,  covering  the  statement 
of  work  lthat  is  to  isolate  in  the  precise  location  in  chromosome  17pl3.2  the  gene  (s)  responsible  for  the 
control  of  Fas-receptor  ligand  complex  function. 

B-ii-a-  Methods  and  procedures. 

DNA  isolation:  DNA  was  prepared  from  MCF-10F,  BP  IE  and  BP  IE- 17  neo  (clone  II-3).  All  the  cells 
cultures  were  treated  with  lysis  buffer  (100  mM  NaCl,  20mM  Tris-Cl  pH  8.0,  25  mM  EDTA  pH  8.0, 
0.5%  SDS)  with  200  |ig/ml  proteinase  K  and  incubated  at  65°C,  15  minutes  with  gentle  agitation.  The 
samples  were  cooled  down  on  ice  and  treated  with  lOOpg/ml  RNAse  at  37°C,  30  minutes.  The  samples 
were  purified  by  extracting  with  1  volume  phenol  followed  by  an  extraction  with  1  volume  of  chloroform: 
isoamyl  alcohol  (24:1).  The  aqueous  layer  was  adjusted  to  0.75M  with  ammonium  acetate  and  the  DNA 
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was  precipitated  with  2  volumes  of  100%  ethanol.  The  samples  were  centrifuged,  dried  and  dissolved  in 
distilled  water. 

PCR  amplification:  PCR  amplification  was  performed  using  the  High  Fidelity  PCR  system  (Roche).  The 
primers  used  to  amplify  were:  Primer  JR  853  Reverse  5’  CTCTCTTAGGATTTACCTTCCC  3’  and 
Primer  JR  864  Forward  5’  T CTTT CT GCT GAT GGCT CTCC  3’.  The  genomic  DNA  from  MCF-10F, 
BP  IE  and  BP  IE- 17  neo  (clone  II-3)  were  use  as  templates  in  different  PCR  reactions.  The  PCR  was 
performed  in  a  lOOpl  volume  containing  100  ng  of  genomic  DNA,  IX  buffer  with  MgCL,  200  pM  of 
each  dATP,  dCTP,  dGTP,  and  dTTP,  300  nM  of  each  primer,  and  5U  Expand  High  Fidelity  enzyme 
(Roche).  The  PCR  conditions  were:  denaturalization  at  94°C,  2  min  followed  by  30  cycles  at  94°C  for  15 
sec,  60°C  for  30  sec  and  72°C  for  50  sec,  followed  for  72°C,  10  minutes  for  extension.  The  10  minutes 
extension  is  important  to  ensure  that  all  PCR  products  are  full  length  and  3’adenylated  (A). 

Cloning  of  the  PCR  products:  The  PCR  products  were  cloned  in  the  vector  pCR2.1  using  the  TOPO  TA 
cloning  kit  (Invitrogen).  The  linearized  vector  supplied  has  a  single,  overhanging  3’  ends  deoxythymidine 
(T)  residues.  This  allows  PCR  inserts  3’  adenylated  to  ligate  efficiently  with  the  vector.  The  ligations 
were  used  to  transform  Escherichia  coli  TOP10  (Invitrogen)  using  the  chemical  transformation  method. 
The  transformants  were  plated  on  LB  agar  with  50pg/ml  kanamycin  and  X-gal  and  the  plates  were 
incubated  at  37°C  during  24h.  The  plasmid  pCR  2.1  has  a  fragment  of  the  lacZ  gene  and  the  cloning  site 
is  within  this  region.  The  peptide  LacZ  produces  a  blue  color  when  the  bacteria  are  plate  on  X-Gal.  The 
LacZ  peptide  will  not  be  present  and  the  colonies  remains  white  if  the  lacZ  gene  has  been  interrupted  by 
the  cloned  insert. 

Plasmid  preparation  and  Sequencing:  Minipreparations  of  the  plasmid  DNA  were  made  using  the 
alkaline  lysis  method  (48).  The  pellet  was  resuspended  in  100  pi  of  distilled  water  and  treated  with 
RNAase  (20  pg/ml)  during  30  minutes  at  37°C.  One  volume  of  PEG/NaCl  (13%  6-8K  PEG,  1.6M  NaCl) 
was  added  and  incubated  on  ice  30  minutes.  The  samples  were  centrifuged  30  minutes  at  4°C.  The  pellet 
was  resuspended  in  50  pi  of  distilled  water.  The  sequencing  reactions  were  done  at  the  Fox  Chase  Cancer 
Center  Facility.  The  Ml 3  Reverse  and  Ml  3  Forward  primers  were  used  for  sequencing.  The  Sequencher 
and  GCG  programs  were  use  for  the  sequences  analysis. 

RT-PCR:  RNA  was  extracted  from  MCF-10F,  BP  IE  and  BPlE-17neo  using  Trizol  reagent  (Life 
Technologies,  Inc.)  and  the  pellet  was  resuspended  in  DEPC  water.  Approximately,  4  pg  of  RNA  was 
treated  with  4  U  DNase  I  (Ambion)  during  30min  at  37°C  followed  by  the  addition  of  5  pi  of  Dnase 
Inactivation  reagent  (Ambion)  according  to  manufacture  protocol.  Another  step  of  DNase  I  inactivation 
was  made  by  incubation  at  80°C,  10  minutes.  The  sample  was  divided  in  two  tubes  with  2  pg  of  RNA 
each.  One  of  these  RNA  samples  was  used  to  prepared  cDNA  and  the  other  was  used  to  test 
contamination  with  DNA  (negative  control).  For  the  cDNA  synthesis,  2  pg  of  RNA  was  mix  with  0.4  pg 
oligo  (dT)  12-18  in  a  final  volume  of  9.6  pi  and  incubated  at  70°C  during  10  min  and  cool  down  on  ice. 
The  following  reagents  were  added:  3.2  pi  5X  First  Strand  buffer,  0.8  pi  lOmM  dNTP  and  1.6  pi  0.1M 
DTT.  After  incubation  during  5  min  at  42°C,  160U  Superscript  II  Reverse  Transcriptase  (Invitrogen)  was 
added.  The  reaction  mix  was  incubated  at  42°C,  1  h.  The  enzyme  was  inactivated  at  70°C,  15  min.  The 
reaction  mix  was  cool  down  on  ice  and  1.6U  of  E.  coli  RNase  H  was  added  and  incubated  at  37°C,  15 
min.  After  the  cDNA  synthesis,  the  PCR  reactions  were  done. 

The  primers  JR869  Reverse  and  JR870  Forward  were  used  and  a  PCR  fragment  of  340  bp  was  expected. 
In  other  PCR  reactions,  the  primers  JR  867  Reverse  and  JR  868  Forward  were  used  and  the  expected 
product  was  273-285  bp.  The  conditions  for  both  PCR  were  the  same.  The  PCR  was  performed  in  a  50 
pi  volume  containing  50  ng  of  DNA,  IX  buffer  with  1.5  Mm  MgCL,  200  pM  of  each  dATP,  dCTP, 
dGTP,  and  dTTP,  200  nM  of  each  primer,  and  5  U  of  Platinum  Taq  (Invitrogen).  The  PCR  conditions 
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were:  denaturalization  at  94°C,  4  min  and  35  cycles  at  94°C  for  45  sec,  56°C  for  45  sec  and  72°C  for  45 
sec,  following  an  extension  at  72°C,  10  minutes.  The  sequences  of  the  primers  are:  Primer  JR  869 
Reverse:  5’  CTG  CTC  CAG  GAT  TTC  AAG  GAC  C  3’,  Primer  JR870  Forward:  5’  GCT  GAT  GGC 
TCT  CCT  ATG  ACC  3’,  Primer  867  Reverse:  5’  CTT  GGG  GTA  TAC  ATG  CAC  CTG  3’  and  Primer 
868  Forward:  5’  TCT  CCC  TGT  GAG  CAT  AGG  GTT  GAC  3’.  RNA  treated  with  DNase  I  (without 
reverse  transcription)  was  used  in  the  PCR  reactions  to  test  for  DNA  contamination  (negative  controls). 


B-ii-b-  Results. 

Cloning  of  D17S796  region. 

It  has  been  reported  that  the  frequent  LOH  of  a  specific  chromosomal  marker  is  indicative  of  a  closely 
linked  tumor  suppressor  gene  (TSG)  (49,  50).  In  previous  experiments  we  have  shown  LOH  in  the  locus 
1 7pl  3.2  using  the  micro  satellite  marker  D17S796  in  BP1E  cells  (Figure  1),  and  a  detailed  analysis  of  this 
region  using  Map  Viewer  (http://  www.ncbi.nlm.nih.gov/  mapview/map  search.cgi)  has  shown  the  KIAA0523 
gene  of  approximately  200  Kb  upstream  and  the  AIPL1  (aryl  hydrocarbon  receptor  interacting  protein¬ 
like  1)  at  78  Kb  downstream  of  this  marker.  Therefore,  we  pursued  the  search  for  a  novel  putative  TSG  in 
this  region,  which  is  approximately  940  bp.  We  have  amplified  this  region  by  PCR  from  MCF-10F,  BP  IE 
and  BP  IE- 17  neo  (clone  II-3),  using  the  primers  JR  853  and  JR864.  The  sequences  of  these  primers  were 
obtained  based  on  the  sequence  from  the  Genebank  AC005668  (Figure  2). 

JR  853  ^ 


126421 

aatgtttttt 

gattcaataa 

actctgtttt 

tcagctttga 

tgctactggt 

gtaactactc 

126481 

tcttaggatt 

- ► 

taccttccct 

gaggggcagc 

ctggtctaaa 

agtcactcca 

ttgtgcactg 

126541 

gatgaagggc 

ccaggaggtg 

agtgagaaca 

tcaaagtggg 

agtggggcca 

ggcaggtgcc 

126601 

actcttggca 

gcttgctctc 

cccagctctc 

ctaaagccag 

ggtcagagca 

tgagccccgg 

126661 

atgggagggg 

catcatcctg 

gacctgatca 

ggcctagaag 

agcttgcgtt 

ctcaaccgag 

126721 

gctggtcttt 

accaagatca 

tccccaacga 

cctgccctcc 

ccagccatct 

ttcctcccat 

126781 

atgctacacc 

ctacatgtct 

tgcaaagctg 

atgtcccctg 

tggctccttg 

actgttcatg 

126841 

tctctctgac 

ctctcctttc 

tcagaccatg 

ggtatgtgag 

cagctaggat 

caaggggcat 
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126901 

gagatttagt 

cctgacgaca 

tgatcattcc 

atgtgtgaat 

ctggtttgcc 

cctttagtcc 

126961 

gataatgcca 

- ► 

ggatgtgcct 

ctttcacagt 

agcctttatg 

ctctcctgag 

tacttggggt 

127021 

atacatgcac 

ctgtgtgtgt 

gtgtgtgtgt 

gtgtgtgtgt 

gtgtgtgtta 

aaaaatactg 

127081 

ctccaggatt 

tcaacjgacca 

catttggttc 

cattgttttc 

tttaattgtg 

gcaagctaag 

D17S796  Forw 

127141 

aagaaaagaa 

ctgcaatgcg 

catgctctgt 

agatggacat 

gcttcccatg 

gtcattacac 

127201 

cagaggagag 

tcgggcacag 

ctcctccccc 

tgccaggctc 

ccaagatcca 

tgctccaatg 

127261 

tcaaccctat 

gctcacaggg 

agagccttga 

tttaagtttg 

gtaaaacgat 

atcccattaa 

127321 

atttatgttg 

ttaatttgaa 

ttttattgat 

ttcgttgttt 

gccgttctat 

aatttgtcca 

127381 

tttgttttgc 

ttttatggtc 

ataggagagc 

catcagcaga 

aagattttac 

tccagcttgc 

◄ - 

JR864 

127441 

atgttggtac 

atatttaaat 

agcattataa 

tagaaattag 

ttgtgtcaac 

aaagagggag 

127501 

aagttctttg 

aaagtatttt 

catttcaaaa 

gaagagccaa 

gcattgctag 

gatttgagaa 

127561 

atattttatt 

taatgttgta 

tttcccacga 

gaggatcatg 

agcgctcaac 

tcttcccaac 

-7- 


127621  cccagctgac  tcctccatga 
127681  gaaattcaaa  gatatgtagt 
127741  ggaagaatct  gggacccagc 
127801  ctggaaactt  gcctgagaaa 
127861  acatgagctg  tgaaagacca 
127921  gggagtgctg  gcatttctct 
127981  tttttctttc  tttgtcccct 
128041  ttagc  128046 


gaaatctgtc  agtggttttc 
gtgttaaggc  cagggtgttg 
ttctgcccag  ttacaaatga 
ccatctgagg  acgtcattgt 
caggacagat  gcagaggact 
ctatccctga  tgccagcaaa 
acgagaagaa  gatatggaga 
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ccagaaggct  gcccccgtgt 
ctctccgaat  gcctccacca 
atgggacaaa  gtcatgttca 
gcgtccatcg  ttccgagaaa 
cccctaaatt  atctacccca 
tgcagccacg  cccaagagat 
ctatgtggtt  attaaagctc 


Figure  2:  Nucleotide  sequence  of  the  region  D17S796  from  the  GeneBank  AC005668.  Part  of  the  nucleotide  sequence  of 
the  GeneBank  AC005668  (Homo  sapiens  chromosome  17,  clone  hRPK.243-K-12),  basesl26421  to  127440,  is  shown .  The 
position  of  the  primers  D1 7S796  Reverse  and  D1 7S796  Forward  used  in  the  SSCP  analyses  are  underline.  Also,  the  position 
of  the  primers  JR853  and  JR864  used  in  the  PCR  amplification  are  shown. 


The  940bp  region  was  amplified  from  using  a  high  fidelity  Taq  polymerase  (Figure  3).  The  PCR  products 
were  cloned  in  the  vector  pCR2.1  and  the  mix  was  used  to  transform  E.  coli  TOP  10  (Invitrogen).  The 
plates  were  incubated  at  37°C  during  24  h  for  detecting  the  growth  of  blue  and  white  colonies  (Figure  4). 
Ten  white  colonies  were  picked  up  and  grown  on  LB  with  kanamycin. 
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Figure  3:  PCR  products 
obtained  from  MCF-10F,  BP1E 
and  BPIE-l  7  neo  clone  II-3. 
The  primers  JR853  and  JR864 
were  used  to  amplified  940  bp 
region  contained  D1 7S796. 
The  total  DNA  from  MCF10-F, 
BP1E  and  BPIE-l  7  neo  (clone 
11-3)  were  used  as  template  in 
three  different  PCR  reactions. 
Approximately,  10  pi  of  each 
PCR  reaction  was  run  on  1.5% 
agarose  gel.  The  940  PCR 
products  are  shown.  PCR 
reactions  without  DNA  were 
also  made  as  control  (negative 
control).  A  100  bp  ladder  was 
run  in  both  gels  as  molecular 
weigh  marker. 


Figure  4:  White  and  blue  colonies  growth  on  LB  agar  with  kanamycin  and 
X-gal.  The  PCR  products  obtained  were  cloned  in  the  vector  pCR2.1.  The 
plasmids  were  used  to  transform  E.  coli  TOP  10.  White  and  blue  colonies 
growth  after  24 h  incubation  at  37°C.  This  plate  shown  the  colonies  obtained 
using  the  PCR  product  amplified  from  BP1E.  White  colonies  harbored 
plasmids  with  the  940bp  insert.  Blue  colonies  harbored  pCR2.1  without 
insert.  Ten  white  colonies  were  picked  up  for  further  analyses. 
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Sequence  analyses. 


The  plasmids  were  prepared  using  the  alkaline  lysis  method  followed  by  PEG/NaCl  precipitation.  The 
inserts  were  sequenced  at  the  Fox  Chase  Cancer  Facility  using  Ml 3  Reverse  and  Forward  primers.  The 
sequences  from  the  different  clones  were  compared  using  Sequencher  and  GCG  programs.  We  found 
differences  between  MCF-10F,  BP  IE- 17  neo  and  BP  IE  in  a  region  of  160-174  bp  that  corresponds  to  the 
marker  D17S796  that  contains  a  zone  of  TG  repetition.  Using  the  multiple  sequence  alignment,  we  found 
that  the  sequences  from  BP1E  differ  with  MCF-10F  and  BPlE-17neo  basically  in  the  number  of  TG 
repetition  that  lies  between  the  primers  D17S796  Reverse  and  D17S796  Forward  (Figure  5).  In  the 
Figure  5,  the  alignment  between  the  sequences  from  BP1E  and  MCF-10F/BP1E-I7neo  are  shown. 
Approximately,  530  bp  that  include  D17S796  region  is  shown.  The  only  missing  bases  were  found  in  this 
region. 


AAGGGGCATCAGAnTAGTCCTGACGACATOATCATTCCATGTGTCAATC 

1 1 1 II 1 1 1  1 1 1 II 1 1 1 1 II II 1 1 1 1  II  I II I  III  1 1 II 1 1 1 1 1 1 II  1 1 1 1 

UGGGGCATGAGAnTAGTCCTGACGACATOATCATTCCATGTCTCAATC 
M7S796  Reverse 


Figure  5:  Sequences  alignment 
between  BplE  and  MCF-10F 
and  BP1E-1 7  neo. 
Approximately,  550bp 

nucleotide  sequence  from  BP1E 
with  19  TG  repetitions  is  shown 
on  the  top.  The  nucleotide 
sequence  from  MCF-10F  and 
BPlE-17neo  with  25  TG 
repetitions  is  shown  at  the 
bottom.  The  positions  of  the 
primers  D17S796  Reverse  and 
D1 7S796  Forward  are 

underline. 

Sequences  with  different 

number  of  TG  repetitions 

were  found  in  the  same  cell 
line.  In  MCF-10F,  five 
alleles  were  found  (Figure 
6).  One  colony  from  9 
(1/9)  shown  18  TG 

repetition;  three  colonies 
(3/9)  shown  20  TG 

repetitions;  one  colony 
(1/9)  shown  21  TG 

repetition;  two  colonies 

(2/9)  shown  24  and  two  others  (2/9)  shown  25  TG  repetitions  (Figure  6).  In  BP1E,  two  different  alleles 
were  found  (Figure  6).  Five  out  of  7  colonies  analyzed  (5/7)  shown  20  TG  repetitions  and  two  (2/7) 
shown  19  TG  repetitions  (Figure  6).  In  BP1E-17  neo,  five  alleles  were  found  (Figure  6).  Two  out  of  10 

analyzed  colonies  (2/10)  shown  19  TG  repetitions  and  2  (2/10)  shown  20  TG  repetitions.  Three  colonies 

out  of  10  analyzed  (3/10)  showed  22  TG  repetitions,  one  (1/10)  shown  23  and  two  (2/10)  shown  25  TG 
repetitions  (Figure  6). 


TGGTTTGCCCCTTTAGTCC  GATAATGCC  AGGATGTGCC  TCTTTC  AC  AGT  A 

1 1 1 1 1 1 1 1 1 1 m 1 1 1 1 1 tit 1 1 ii  1 1 1 1 1 1 1 1  i  1 1 1 1  i  i  ii  1 1 1 1 1 1 1 1 

TGGTTTGCCCCTTTAGTCCGATAATGCCAGGATGTGCCTCnTCACAGTA 

GC  CTTTATGCTCTC  CTG AGTAC  TT GGGGTAT ACATGCAC  C . 

I  III  llllllll  IM1IIII  III  II  I  Mill  I  I  I  I  II  1 II 
GCCTTTATGCTCTCCTGAGTACTTGGGGTATACATGCACCTGTGTGTGTG 

. .  TGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTCTGTTAAAAAATA 
I  1 1  II I  1 1  I  1 1  I  I  I  I  I  II  I  I  I  I  I  1 1  I  1 1  I  I  1 1  I  I  1 1  I  1 1  I  I  I  II  II I  I 
TCTGTGTGTCTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTTAAAAAATA 

CTGCTCC AGGATTTCAAGGAC  C  AC  ATTTGGTTCCATTGTTTTCTTT AATT 

I  I  I  1 1  I  1 1  1 1  I  1 1  I  I  I  I  II  1 1  I  I  I  I  1 1  I  1 1  I  III  I  I  1 1  I  1 1  I  I  II  I  I  1 1  I 

TTTTCTTTAATT 

D17S796  Forward 

GTGGCAAGCTAAGAAGAAAAGAACTGCAATGCGCATGCTCTGTAGATGGA 
1 1 1 1 1 1 1 1 1 1 1 1 !  1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 

GTGGCAAGCTAAGAAGAAAAGAACTGCAATGCCCATGCTCTGTAGATGGA 
CATGCTTCCC  ATGGTC  ATT  AC  ACC  AGAGGAGAGTGGGGC  AC  AGC  TCC7CC 

iiiiiiiiiiiiiiiiimiiiijmiiiiiiiiiiiiiiiiiiiiii 

CATGC  TTC  CC  ATGGTC  ATT AC  ACC  AGAGGAGAGTCGGGC  AC  AGC7CCTCC 
CCCTGC  C  AGGC  TC  C  CAAGA  ACC  ATGCTCCAATGTC  A  ACCCTATGCTC  AC  A 

iiiiimiiiiiMiiiiiiiiiiiiiiiiimiMiiiiiiiiiiii 

CCCTGCCAGGCTCCCAAGAACCATGCTCCAATGTCAACCCTATGCTCACA 

GGGAGAGCCTTGATTTAAGTTTGGTAAAACGATATCCCATTAAATTTATG 
I  I  I  1 1  I  II  1 1  I  I  I  I  1 1  I  1 1  1 1  1  1 1  i  1 1  I  1 1  1 1 1 1 1  I  I  1  1 1 1  I  1 1  1 1  I  1 1  I 
GGGAGAGCCTTGATTTAAGTTTGGTAAAACGATATCCCATTAAATTTATG 

TTGTTAATTTGAATTTTATTGATTTCGTTGTTTGCCGTTCTATAATTTGT 
I E I  1 1  I  1 1  1 1  I  1 1  I  1 1  I  1 1 1 1  I  I  I  II  I  I  1 1  I  1 1 1 1 1  1 1  I  1 1  1 1 1  II  I  1 1  I 

TTGTTAATTTGAATTTTATTGAnTCGTTGTTTGCCGTTCTATAATTTGT 


CCATTTGTTTTGCTTTTATGGTCATAGGAGAGCCATCAGCAGAAAGAAAG 
III  llllllllllllllllllllllllllllllllllllllllllllll 
CC  ACTTGTTTTGC  TTTTATGGTCAT  AGGAGAGCCATCAGCAG  AAAG  AAAG 


Our  PCR-SSCP  analysis  of  MCF10F  and  BP  IE  cells  reveled  that  in  the  process  of  chemical 
transformation  loss  of  genetic  material  (LOH)  in  the  chromosome  17  pl3.2-pl3.1  (D17S796),  has  been 
restored  in  the  cells  with  abrogated  transformation  phenotype.  The  LOH  found  in  BP  IE  using  PCR- 
SSCP  (Figure  1)  is  explained  by  the  lost  of  the  bigger  PCR  products.  The  expected  sizes  of  the  PCR 
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products  using  D17S796  Reverse  and  D17S796  Forward  are  indicated  in  each  case  based  on  the 
sequences  obtained  (Figure  6).  According  to  our  previous  results,  one  of  these  fragments,  present  in 
MCF-10F  and  BP  IE- 17  neo  and  absent  in  BP1  could  be  responsible  for  the  reversion  of  the  transformed 
phenotype  of  BP1E-17  neo.  We  found  that  BP1E  has  one  allele  with  (TG)i9  and  the  other  allele  with 
(TG)2o  (Figure  6).  The  same  fragments  were  also  found  in  BP  IE- 17  neo.  From  the  others  three  alleles 
present  in  BP1E-17  neo,  only  the  one  with  25  TG  repetitions  was  also  found  in  MCF-10F.  It  is  possible 
that  this  fragment  was  responsible  for  the  reversion  of  the  phenotype  of  BP1E-17  neo. 


MCF-10F: 

Number  of 

colonies 

Expected  size  of  the 

D17S796  fragment 

.. 

. CACC  (TG)  w  TT  AAA  AAA . 

V9 

160  bp 

. CACC  (TG)  20  TT  AAAAAA . 

3/9 

164  bp 

. CACC  (TG)  21  TT  AAAAAA . 

1/9 

166  bp 

. CACC  (TG)  24  TT  AAAAAA . 

2/9 

172  bp 

.. 

. CACC  (TG)  25  TT  AAAAAA . 

2/9 

174  bp 

BP1E: 

... 

....CACC  (TG)  19TT  AAAAAA . 

in 

162  bp 

... 

....CACC  (TG)  2Q  TT  AAAAAA . 

5/7 

164  bp 

BPE1-17  neo: 

... 

....CACC  (TG)  lffTT  AAAAAA . 

2/10 

162  bp 

•• 

. CACC  (TG)  20  TT  AAAAAA . 

2/10 

164  bp 

. 

. CACC  (TG)  22  TT  AAAAAA . 

3/10 

168  bp 

. 

1/10 

170  bp 

. 

. CACC  (TG)  25  TT  AAAAAA . 

2/10 

174  bp 

Figure  6:  Differences  inMCFlOF, 
BP1E  and  BP1E-17  neo  in  the 
different  clones.  There  are 
different  numbers  of  TG 
repetitions  in  the  D1 7S796  region. 
Five  different  alleles  are  found  in 
MCF-10F  and  BP1E-17  neo,  and, 
two  alleles  in  BP1E.  The  number 
of  colonies  with  the 
corresponding  insert  and  the  total 
number  of  colonies  analyzed  in 
each  case  are  indicated  in  the  first 
column.  The  expected  size  of  the 
PCR  fragments  using  the  primers 
D17S796  Rev  and  D17S796 
Forward  are  indicated  at  the 
right  (bp:  base  pairs). 


We  have  performed  a  detailed  computational  analysis  of  the  fragment  with  25  TG  repetitions  isolated 
from  MCF-10F  and  BP  IE- 17  neo.  Sequence  translation  shown  a  putative  peptide  of  131  amino  acids 
(Figure  7).  The  predicted  amino  acid  sequence  does  not  seem  to  share  significant  homology  with  any 
known  protein. 


l  TITAGICC®.  TAATQOCAQ3  AIGIGOCTCT  TTCACAGTAG  OCTITA.TQCT  CIOCIGM3TA 
.SD  NAR  MCLF  HSS  LYA  LLS 

6i  CTTQGQSTAr  PCKTGCPCCF  GIGIGTGTGT  GICTGTGICT  GIGICTGTGT  GIGICT3IGT 
TWGI  HAP  VCVC  VCV  CVC  VCV 

i2i  GTCICTGTGT  TAAAAAATAC  TQCIOCAQGA  TTICAAGGBC  CACALTIOST  TOCATIGTIT 
CVCV  KKY  CSRI  SRT  TFG  SIV 

181  TCnTAAITG  TOQCAAQGTA  AGAASAAAG  AACIGCAATG  CGCATOGICT  GEA3ATGGAC 
FFNC  GKL  RRKE  LQC  ACS  VDG 

24i  ATOCITOOCA  TOGICMTAC  AOCA3A3GAG  AGIOQQS^C  AGCIOCTQCC  OCIGOCAQQC 
HASH  GHY  TRGE  SGT  APP  PAR 

301  TOOCAAGWC  CMGCTCCAA  TGICAAGOCT  ATGCIC ftCAG  GGAfflOOCIT  GAnTAftGIT 
LPRT  MLQ  CQPY  AHR  E  S  L  DLS 

361  TQ3TAAMOG  ATATOOCAIT  AAMTT ATGT  TGTIAATTIG  AATTITATTG  ATT 
LVKR  YPI  KFML  LI. 


Figure  7:  Translation  of  the 
sequence  present  in  MCF-10F 
and  BP1E-17  neo.  The  putative 
peptide  of  131  amino  acids 
obtained  from  the  sequence 
analyzes  is  shown.  The  first  codon 
of  this  putative  exon  (TCC)  that 
encodes  the  amino  acid  serine  (S) 
is  underlined 

Comparison  of  the  DNA 
sequences  of  many  different 
genes  revealed  certain 
similarities  at  the  intron-exon 
junctions  (53).  The  base 
sequence  of  the  intron 
usually  ends  with  AG.  The 
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3'  splice  site  sequence  of  most  introns  is  similar  to  the  sequence  (C/U)<ioN(C/T)AG/G  such  that  most 
introns  end  in  AG  after  a  long  stretch  of  pyrimidines  (53).  These  sites  appear  to  be  very  important  for 
introns  processing.  We  have  found  that  in  the  sequence  of  this  novel  protein,  the  bases  AG  are  present  at 
the  3’  putative  splice  site  (Figure  7).  In  mammalian  genes,  polyadenylation  sites  are  usually  preceded  by 
AATAAA  or  ATTAAA  ~20  bases  before  the  cleavage  site  and  followed  by  a  more  weakly  conserved 
GT-based  motif.  A  putative  polyadenylation  site  ATTAAA  and  GT  region  was  found  at  the  3’  end. 
These  data  suggest  that  this  peptide  could  be  the  last  exon  of  a  novel  protein.  This  peptide  forms  part  of  a 
larger  novel  protein  and  it  is  possible  that  also  others  part  of  this  protein  would  be  mutated  in  BP  IE. 

Sequences  analyzes  from  BP1E,  showed  that  translation  of  the  sequence  with  19  TG  repetitions  will  give 
a  putative  peptide  of  127  amino  acids  similar  to  the  one  found  in  BP1E  and  BP1E-17  neo  (Figure  8). 
These  peptides  differ  in  four  amino  acids:  VCVC  (V:  valine;  Cysteine).  The  translation  of  the  sequence 
with  20  TG  repetitions,  also  found  in  BP  IE,  have  a  non-sense  codon  (stop  codon)  and  a  shorter  peptide  of 

61  aminoacids. 


Figure  8:  Alignment 
between  the  putative 
peptides  from  BP1E  and 
the  putative  peptide  from 
MCF-10F  and  BP  IE-17 
neo .  The  peptide  alignment 
has  shown  that  four 
aminoacids  are  missing  in 
BP1E : 


BP1E  1  SDNARMCLFHSSLYALLSTWGIHAP. . .  .VCVCVCVCVCTCVKKYCSRIS  46 

iiiiiiiiiiimiiimimi  iiiiiiiiimmmm 

MCF-lOF and BP1E-17  neo  1  SDNARMCLFHSSLYALLSTOGIHAPVCVCVCVCVCVCVCVCVKKYCSRIS  50 
47  RTTFGSIVFFNCGKLRRKELQCACSVDGHASHGHYTRGESGTAPPPARLP  96 

iiiiimimmiiiiiiiiiimmiimimmiiiiiii 

51  RTTFGSIVFFNCGKLRRKELQCACSVDGHASHGHYTRGESGTAPPPARLP  100 
97  RTMLQCQPYAHRESLDLSLVKRYPIKFMLLI*  128 

iimiiiimiiiimimimmii 

101  RTMLQCQPYAHRESLDLSLVKRYPIKFMLLI*  132 


The  putative  peptide  was  compared  with  known  proteins  using  BLASTP.  The  predicted  peptide  was 
compared  with  a  known  proteins  using  BLASTP.  The  Swiss-Prot/  TrEMBL  primary  accession  numbers  are  shown 
(http://us.expasy.org/sprot).  The  putative  peptide  of  131  amino  acids  showed  very  low  homology  with 
known  proteins  (Table  1).  The  nuclear  transition  protein  2  (DNA-binding  protein)  shown  22.1%  similarity 
and  15.3%  identity. 


Table  1.  Homology  between  the  predicted  peptide  of  131  amino  acids  and  different  known  proteins. 


Protein 

%  Identity 

%  Similarity 

Accession 

numbers 

Transitional  protein  2  (Nuclear 
transition  protein  2) 

15.3% 

22.1% 

Q9N1A7 

Similar  to  expressed  sequence 
AW049250 

14.5  % 

22.1% 

Q8K116 

Histamine  H3  receptor  (HH3R) 

14.5% 

22.1% 

P58406 

Netrin  4  (P  netrin) 

11.5% 

17.6% 

Q9JI33 

Study  of  the  expression  of  the  region  that  contains  D1 7S79. 
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As  our  “in  silico ”  analyzes  has  shown  a  putative  novel  peptide  in  the  region  D17S796.  The  next  step  was 
to  use  RT-PCR  to  determine  if  there  is  expression  of  this  region  in  MCF-10F,  BP1E  and  BPlE-17neo 
cells.  For  this  purpose  total  RNA  was  extracted  from  these  cells  and  cDNA  was  synthesized  using 
reverse  transcriptase  and  oligo(dT).  The  cDNA  obtained  from  the  different  cell  lines  were  used  in  PCR 
reactions  with  specific  primers  for  the  putative  coding  region.  Primers  with  annealing  sites  within  the 
putative  coding  region  were  used  in  the  amplifications  and  PCR  products  would  be  expected  only  if  there 
is  mRNA  corresponding  to  this  region  (Figure  9).  A  fragment  of  340  bp  was  obtained  in  the  three  cell 
lines  when  the  primers  JR869  and  JR  870  were  used  (Figure  10).  A  fragment  of  approximately  285  bp 
was  obtained  when  the  primers  JR867  and  JR868  were  used  (Figure  10).  These  results  showed  that  the 
region  that  harbors  D17S796  is  transcribed  to  mRNA  in  MCF-10F,  BP1E  and  BP1E-17  neo. 

gacctctcctttctcaga . ccatgggtatgtgagcagctaggatcaaggg 

I  I  II  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  II  II  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  II  II  I 

GACCTCTCCTTTCTCAGACCCATGGGTATGTGAGCAGCTAGGATCAAGGG 


gcatgagatttagtcct  q  a  c  g  a  c  a  t  q  a.  t:  c  a  1 1  c  c  a  t  g  t  g  t  q  a  a  t  c  t  q  g  t:  t 

I  I  I  I  I  I  I  I  I  II  I  I  I  I  I  I  I  II  I  II  I  II  I  I  I  I  1  I  II  II  M  I  II  I  I  I  I  I  I  I  I  I 

GC AT  GAG AT  T  T AGT  CC  T  G  AC  GAG AT GAT CAT T  CC AT GT G T  GAAT CT  G GTT 


Putative  exon 

t  q  c  c  c  c  1 1 1  a  g  tccga  t  a  a  t  q  o  ca  q  q  a  t  q  t  gc  c  t  c  1 1:  t  ca  c  a  g  t  a  q  c  c  1 1 

I  II  I  I  I  I  I  II  I  II  I  I  I  II  II  II  II  II  II  II  I  I  I  I  I  I  I  II  I  II  I  II  I  I  I  II 

TGCCCCTTTAGTCCGATAATGCCAGGATGTGOCTCTTTCACAGTAGCCTT 

JR867 _ ^ 

t  a  t:  c  f  c  t  c  t  c  o  t  q  a  q  t  a  c  1 1  q  q  q  g  t  a  t  a  c  a  t  g  c  a  c  o .  .  t  g  t  q  t  g  t  q  t  g  t  g  t 

I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  II  II  I  i  II  I  i  M  I 

TAT  G  CT  CT  CCT  GAGT  A  CTT  GG  GGT  AT  ACAT  GC  ACCT  G  T1  GT  G  T  GT  GT  GT  G  T 


JR8  69 


g  t  g  t  q  t  <  J  t  q  t  g  t  q  t  g  t  q  t  g  t  q  t  g  1 1  a  a  a  a  a  a  t  a  c  t  q  c  t:  c  c  a  q  g  a  t 1 1  c  a  a 

II II II II II II  I  I  II  II  II  II  I  II  M  I  I  I  I  I  I  I  M  I  I  I  II  I  II  M  I  I  II 

GTGTGTGTGTGTGTGTGTGTGTGTTAAAAAATACTGCTCCAGGATTTCAA. 


ggaccaoatttggttccattqttttctttaattgtggcaagctaagaaga 

1 1 1 1 1 1  ii  i  ii  m  1 1 1 1 ii i i 1 1 1 1 1 1 1 1 1 1  ii  1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

GGACCACATTTGGTTCCATTGTTTTCTTTAATTGTGGCAAGCTAAGAAGA 


aaagaactgcaatgcgcatgctctgtagatggacatgcttcccatggtca 
I  II  II  I  I  I  I  II  I  I  I  I  II  I  I  I  I  I  I  I  I  I  I  I  II  I  I  I  I  II  I  I  I  I  I  I  I  I  I  I  I  I  I  I 
AAAGAACTGCAATGCGCATGCTCTGTAGATGGACATGCTTCCCATGGTCA 


Figure  9:  Annealing  sites  of  the 
primers  used  in  RT-PCR 
experiments .  The  sequences  of 
AC005668  (upper  sequence)  and 
MCF-10F  (bottom  sequence)  are 
shown  with  the  corresponding 
sites  for  the  primers  used  in  the 
RT-PCR:  JR869 \  JR870 ,  JR867 
and  JR868.  Also  the  site  for 
JR864  is  shown .  The  beginning 
of  the  putative  exon,  the  stop 
codon  and  the  ATTAAA  at  the  3’ 
are  shown. 


ttacaccagaggagagtcgggcacagctcctccccctgccaggctcccaa 
II  I  I  I  I  I  I  I  I  II  I  I  I  I  II  I  II  I  II  I  I  II  I  I  I  I  I  I  I  I  II  I  I  I  I  I  I  I  I  I  I  I  I 
TTACACCAGAGGAGAGTCGGGCACAGCTCCTCCCCCTGCCAGGCTCCCAA 

_ JR8  68 _ 

gatccatgctccaatgtcaaccctatgctcacagggagagccttgattta 
II  I  I  I  I  II  I  I  I  I  I  I  II  I  I  I  I  I  I  I  II  II  I  I  I  I  II  I  I  I  I  II  I  I  II  I  I  I  I  I  I 

GAACCATGCTCCAATGTCAACCCTATGCTCACAGGGAGAGCCTTGATTTA 

Stop  codon 

agtttggtaaaacgatatcccattaaatttatgttgttaatttgaatttt 
I  I  I  I  I  II  I  I  I  !  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  II  II  I  I  I  I  I  I  I  I 
AGTTTGGTAAAACGATATCCCATTAAATTTATGTTGTTAATTTGAATTTT 

attgatttcgttgtttgccgttctataatttgtccatttgttttgctttt 
I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  II  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I 
ATTGATTTCGTTGTTTGCCGTTCTATAATTTGTCCATTTGTTTTGCTTTT 
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_ JR8  64 _ 

atggtcataggagagccatcagcagaaagattttactccagcttgcatgttg 

I  II  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I!  I  I  II  I  I  I  I 

AT  GGT  C AT AG  G  AG  AGC  C  AT  C  AG  C AG AAAG A 
*  JR870 

gtacatatttaaatagcattataatagaaattagttgtgtcaacaaagagggagaagttctttgaaagtattttcatttcaaaagaag 
agccaagcattgctaggatttgagaaatattttatttaatgttgtatttcccacgagaggatcatgagcgctcaactcttcccaaccccagctg 
actcctccatgagaaatctgtcagtggttttcccagaaggctgcccccgtgtgaaattcaaagatatgtagtgtgttaaggccagggtgttgct 
ctccgaatgcctccaccaggaagaatctgggacccagcttctgcccagttacaaatgaatgggacaaagtcatgttcactggaaacttgcctga 
gaaaccatctgaggacgtcattgtgcgtccatcgttccgagaaaacatgagctgtgaaagaccacaggacagatgcagaggactcccctaaatt 
atctaccccagggagtgctggcatttctctctatccctgatgccagcaaatgcagccacgcccaagagattttttctttctttgtcccctacga 
gaagaagatatggagactatgtggtt  attaaagctcttagc 

Also  we  found  that  the  expressed-sequence  tag  EST  3179739  matches  a  region  that  is  located  120  bp 
downstream  of  the  cloned  region  (Figure  1:  bases  127541  to  127544).  The  EST  3179739  sequence  comes 
from  a  cDNA  library  from  lung  (tissue  type:  carcinoid).  Expressed  sequence  tags  (ESTs)  are  short  (200- 
500bp)  DNA  sequences  generated  from  the  3'  and  5'  ends  of  randomly  selected  cDNA  clones  and  the 
purpose  of  EST  sequencing  is  to  rapidly  scan  for  all  the  protein  coding  genes  and  to  provide  a  tag  for  each 
gene  on  the  genome.  A  99%  identity  was  found  between  both  sequences  using  Blast 
('www.ncbi.nlm.nih.gov/blast/Blast.cgiT 

These  results  strongly  suggest  that  we  have  cloned  the  last  exon  of  a  novel  protein. 


m  a  Kile’ 

1A  2A  3A  a  IB  2B  3B  "  Ladder 


Negative  controls 


Figure  10:  Study  of  the 
expression  of  the  D1 7S796  region 
in  MCF-10F,  BP1E  and  BP1E-1 7 
neo  by  R  T-PCR  experiments . 
Upper  gel:  RNA  from  MCF-10F 
(1),  BP1E  (2)  and  BPlE-17neo 
(3)  were  used  for  reverse 
transcription .  The  cDNA  obtained 
were  used  in  PCR  reactions .  The 
primers  JR  869  and  JR870  were 
use  in  some  reactions  (A).  In 
another  PCR  reaction,  the 
primers  JR867  and  JR868  were 
used  (B).  Fragments  of 340bp  and 
285  bp  were  obtained  respectively. 
Bottom  gel:  RNAs  from  the 
different  cells  were  used  directly 
in  the  PCR  reactions  as  control  of 
DNA  contamination  without 
reverse  transcription.  Also, 
reagents  controls  for  both  PCR 
reactions  were  made  (Blank  A 
and  Blank  B).  A  lOObp  ladder 
was  run  at  the  right 
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B-ii-c-Next  proposed  plan  of  research 

We  are  planning  to  continue  with  the  aims  proposed  using  the  following  strategies: 

1- The  full-length  cDNA  corresponding  to  this  novel  gene  will  be  isolated  using  “rapid  amplification  of 
cDNA  ends”  (RACE).  RACE  is  a  procedure  for  obtaining  full-length  cDNA  copies  of  low  abundance 
mRNAs.  This  technique  achieves  amplification  of  the  region  between  a  single  short  sequence  in  a  cDNA 
and  its  unknown  3’  or  5’  end  (51).  The  3 ’RACE  takes  advantage  of  the  natural  poly(A)  tail  found  in 
mRNA  as  priming  site  for  PCR.  The  mRNAs  are  converted  into  cDNA  using  reverse  transcriptase  (RT) 
and  an  oligo-dT  adapter  primer.  Specific  cDNA  is  then  amplified  by  PCR  using  a  gene-specific  primer 
(GSP)  that  anneals  to  a  region  of  known  exon  sequences  and  an  adapter  primer  that  targets  the  poly(A) 
tail  region.  This  permits  the  capture  of  the  3’ -mRNA  sequences  located  between  the  exon  and  the 
poly(A)  tail.  The  5’  RACE  uses  an  antisense  gene  specific  primer  for  the  synthesis  of  specific  cDNA  by 
reverse  transcriptase.  Prior  to  PCR,  a  TdT-tailing  step  attaches  an  adapter  sequence  to  the  unknown 
5 ’sequences  of  the  cDNA.  Specific  cDNA  is  then  amplified  by  PCR  using  a  GSP  that  anneals  in  a  region 
of  known  exon  sequences  and  an  adapter  primer  that  targets  the  5’  terminus.  Products  generated  by  the  3’ 
and  5’  RACE  procedures  may  be  combined  to  generated  full-length  cDNAs.  RACE  has  been  used  for 
amplification  and  cloning  of  rare  mRNAs  (51).  The  full-length  cDNA  will  be  cloned  in  stable  expression 
vector  such  as  pSV2neo  and  transfection  experiments  will  be  performed. 

2- We  have  hypothesized  that  this  novel  protein  could  be  one  of  the  proteins  in  the  caspase  cascade  of 
apoptosis  through  CD95/Fas  signaling.  Northern  blot  analysis  will  be  performed  to  study  if  apoptotic 
agents  like  human  chorionic  gonadotropin  (hCG)  induces  an  elevation  of  the  expression  of  this  novel 
gene.  Our  previous  findings  showed  that  hCG  induces  an  elevation  of  the  expression  of  apoptotic  genes 
like  Trpm2,  ICE,  TGF-(3,  p53  and  p21  in  MCF-10F  (52).  Using  Northern  blot  analysis,  information  such 

Figure  11.  Chromosomal 
localization  of  D17S796  marker  in 
chromosome  17.  The  position  of  the 
D17S796  in  the  chromosome  17  is 
shown.  Some  of  the  genes  near 
D17S796  are  described  like  the 
arylhydrocarbon  receptor 

interactiong  protein-  like  and  the 
death  effector  filament-forming 
Ced-4-like  apoptosis  protein 
(DEFCAP). 


3 -A  detailed  analyses  will  be 
done  to  study  mutations  in 
other  known  genes  localized  at 
17pl3. 1-13.2  near  the 
D17S796  marker.  The  known 
genes  located  in  this  region  are 
DEFCAP,  KIAA0523,  Aryl 
hydrocarbon  receptor  interacting  protein-likel  and  the  hypothetical  protein  FLJ10156  (Figure  11). 
Different  primers  will  be  used  to  analyze  putative  mutations  in  them  using  PCR-  SSCP  analyses  (Table  2). 


as  the  size  of  the  mRNA  and  expression  level  will  be  obtained. 


m?$93S—  B17S796— 


Legend: 

-  bovn4*rifis  **  re^ico'SHF  -  otfver  feature 

r - 1  -  sequence  f racier. t  sheen 

1.  Arylhydrocarbon  receptor  interacting  protein  -like/  Hypotetical  protein  FLJ10156 

2.  LOC  255868 

3.  KIAA0523 

5.  Ced  4 -like  apoptosis  protein 
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These  studies  will  help  to  determinate  regions  near  D17S796  marker  that  could  also  be  mutated  during  the 
transformation  of  BP1E.  Genomic  DNA  from  MCF-10F,  BP1E  and  BP1E-17  neo  cells  will  be  used.  The 
availability  of  different  markers  on  chromosome  17  will  allow  us  to  perform  a  detailed  analysis  for 
locating  regions  that  have  been  retained  after  transfer  of  this  chromosome  to  the  cells  of  interest.  This 
analysis  will  define  regions  near  D17S796  that  could  also  change.  The  sequences  and  characteristics  of 
microsatellite  oligonucleotide  primers  were  obtained  from  GDB  database  ('http://www.gdb.org)  and 
Cooperative  Human  Linkage  Center  Web  site  (http://www.chlc.org).  These  experiments  will  allow  us  to 
determinate  all  the  gene(s)  involved  in  the  reversion  of  the  phenotype  in  BP1-17  neo. 


Table  2:  Details  of  the  primers  that  will  be  used  in  LOH  mapping  at  1 7pl3.1-13.2. 


Locus 

Position 

Size  range 

Primer  sequence  5’ — 3' 

Locus  description 

RH121127 

17p13 

273  bp 

Forw:  AAT GTT CCACAGAGCCAT GAAGT 

Rev:  TCCATTGTAGAAMCCTGGTCGT 

Death  effector  filament-forming 
Ced-4-like  apoptosis  protein 
(DEFCAP) 

SHGC-1 48247 

1 7p1 3 

307  bp 

Forw:  T  G  ACTT  CTTCCAGTGT  CTT  G  CAG 

Rev:  AACT CCTT C AG CTCCT CCTT CTT 

Death  effector  filament-forming 
Ced-4-like  apoptosis  protein 
(DEFCAP) 

Cdal bh06 

1 7p1 3.2 

110  bp 

Forw:  T CTTT CT GCCT CT CAACT CT C 

Rev:  AAGGTAGTAGTATTCCTGGGTC 

KIAA0523  protein 

D17S938 

164-182  bp 

Forw:  CCGGATT GCT ACACCT AAAT 

Rev:  AAC AGT CT CTNCT GG AGCAG 

D17S1579 

Forw:  CG AT CCAAACCT CT CCT CAG 

Rev:  TCTCCTCTGAATGCTAAGACTGC 

G42173 

Forw:  GAGAGCT CTAAGT GT CCT GC 

Rev:  TTT  CT  CAT  GCCT  GAT  ACCCC 

Aryl  hydrocarbon  receptor 
interacting  protein-likel  (AIPL1) 

RH 104452 

Forw:  AGGGCAGAAATTGCTCTTGA 

Rev:  TTT  CCCT  ACCAG  G  G  ACT  CG 

Hypothetical  protein  FLJ10156 

D17S260 

147  bp 

Forw:  AAT GGCT CCAAAAGGAGAT ATT G 

Rev:  CT  CCCCAACAT  GCTTT  CT  CT  C 

D17S2057 

260-261  bp 

Forw:  TT  GTTT  GT  CAAACACTT  CCT  GG 

Rev:  TGCAAACTTCAAGAGGAGGG 

D17S600 

100-990  bp 

Forw:  GCTTTCCCATTACTGCCCAGTTC 

Rev: 

CT  CT  CAT  CACTGCT  CCACG  ATT  AT  AC 

WIAF-1598 

17p13.2 

186-189  bp 

Forw:  CCT  ATT  AATT  CAAGT  GT  AC  AC  AT 

Rev:  CT  CT  GAAGT  ACAAGAATTT  CAG 

KIAA0753  gene  product 

D17S1881 

216-230  bp 

Forw:  CCCAGTTT AAGG AGTTT GGC 

Rev:  TAGGGCAGTCAGCCTTGTG 

SHGC-32950 

17p13.2 

125  bp 

Forw:  CGT GCACT CCT CCAGT AGC 

Rev:  GT  AGGCG  AAAATGGT  CTT  GC 

Hypothetical  protein  MCG14353 
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RH91775 

17p13.2 

128  bp 

Forw:  MCAGCAGATTCTACATTGGCA 

CGI-125  protein 

Rev:  TTAAGTG CCT CGTTT GTATCC A 

C-  KEY  RESEARCH  ACCOMPLISHMENTS: 

We  found  a  novel  gene  in  the  region  D17S796.  This  gene  is  located  at  approximately  200  Kb  of 
the  KIAA0523  gene  and  78  Kb  of  AIPL1  (aryl  hydrocarbon  receptor  interacting  protein-like  1). 

-  We  have  cloned  the  last  exon  of  this  novel  gene  and  the  sequence  of  this  fragment  will  be  used  to 
select  the  full-  length  cDNA. 

-  RT-PCR  experiments  shown  that  this  gene  is  expressed  in  MCF-10F,  BP  IE  and  BP1E-17  neo  and 
the  EST  3179739  matched  the  3’  end  of  this  gene. 

We  found  difference  in  the  region  D17S796  between  MCF10F,  BP1E-17  neo  and  BP1E.  The 
difference  is  in  the  number  of  the  TG  repeats  in  this  region.  We  found  that  MCF10F  and  BP  IE- 17 
neo  has  one  allele  in  which  the  D17S796  region  is  174  bp  although,  BP  IE  shown  a  shorter  region. 


D-REPORTABLE  OUTCOMES 

No  publications  have  been  originated  from  these  data. 

E-CONCLUSIONS 

Loss  of  genetic  material  (LOH)  in  the  chromosome  17  pl3.2  at  the  micro  satellite  marker  D17S796  has 
been  identified  in  hepatocellular  carcinoma  (54),  atypical  ductal  hyperplasia  and  in  situ  ductal  carcinoma 
of  breast  (39,  55).  Our  results  shown  LOH  at  the  same  region  in  MCF-10F  cells  treated  with  the  chemical 
carcinogen  benz(a)pyrene  (BP).  Moreover,  microcell-mediated  transfer  of  an  intact  copy  of  human 
chromosome  17  inhibits  the  transformation  phenotype  of  BP  IE  cells  and  PCR-SSCP  analyzes  show  a 
restoration  of  the  lost  material  in  BP1E-17  neo.  These  experiments  suggest  the  presence  of  a  TSG  in 
1 7pl 3.2  near  the  marker  D17S796.  For  this  purpose  a  940  bp  region  has  been  amplified  and  cloned.  The 
BP  IE  cell  has  lost  10-12  bases  consisting  in  a  TG  repetition.  RT-PCR  experiments  have  shown  that  this 
region  is  expressed  in  MCF-10F,  BP  IE  and  BP  IE- 17  neo.  Also  we  have  found  that  the  expressed- 
sequence  tag  EST  3179739  matches  with  the  3’end  of  this  region.  Computational  analyses  show  that  the 
cloned  fragment  could  be  the  last  exon  of  a  bigger  peptide.  The  presence  of  a  3’  splicing  site  in  the 
putative  introns  and  the  ATT  AAA  region  at  the  3’end  support  this  idea.  The  predicted  amino  acid 
sequence  does  not  share  significant  homology  with  any  known  protein  supporting  the  idea  that  this  could 
be  a  novel  protein.  Further  experiments  will  be  done  in  order  to  clone  the  full-length  cDNA  and  to  study 
the  regulation  of  the  expression  of  this  novel  gene. 
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